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ABSTRACT 

We develop a template-fit method to automatically identify and classify late-type K and M dwarfs in spectra 
from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST). A search of the commis¬ 
sioning data, acquired in 2009-2010, yields the identification of 2,612 late-K and M dwarfs. The template 
fit method also provides spectral classification to half a subtype, classifies the stars along the dwarf-subdwarf 
(dM/sdM/esdM/usdM) metallicity sequence, and provides improved metallicity/gravity information on a finer 
scale. The automated search and classification is performed using a set of cool star templates assembled from 
the Sloan Digital Sky Survey spectroscopic database. We show that the stars can be efficiently classified de¬ 
spite shortcomings in the LAMOST commissioning data which include bright sky lines in the red. In particular 
we find that the absolute and relative strengths of the critical TiO and CaH molecular bands around 7000Aare 
cleanly measured, which provides accurate spectral typing from late-K to mid-M, and makes it possible to 
estimate metallicity classes in a way that is more efficient and reliable than with the use of spectral indices or 
spectral-index based parameters such as Cno/CaH- Most of the cool dwarfs observed by LAMOST are found to 
be metal-rich dwarfs (dM). However, we identify 52 metal-poor M subdwarfs (sdM), 5 very metal-poor extreme 
subdwarfs (esdM) and 1 probable ultra metal-poor subdwarf (usdM). We use a calibration of spectral type to 
absolute magnitude and estimate spectroscopic distances for all the stars; we also recover proper motions from 
the SUPERBLINK and PPMXL catalogs. Our analysis of the estimated transverse motions suggests a mean 
velocity and standard deviation for the UVW components of velocity to be: <U>=-9.8 km/s, ay=35.6 km/s; 
<V>=-22.8 km/s,a v -=5 0.6 km/s; <W>=-7.9 km/s, a w =2HA km/s. The resulting values are general agreement 
with previous reported results, which yields confidence in our spectral classification and spectroscopic distance 
estimates, and illustrates the potential for using LAMOST spectra of K and M dwarfs for investigating the 
chemo-kinematics of the local Galactic disk and halo. 

Subject headings: Stars: low-mass — Stars: kinematics and dynamics — Methods: data analysis — Surveys 


1. INTRODUCTION 

M dwarfs are the dominant class of stars in the Galaxy, and 
comprise ~70% of all hydrogen-burning object s dReid et al.1 
120021 ICovev et~afl l2008t iBochanski et al.1120101) . Metal-rich 
M dwarfs and their metal-poor counterparts, the M subdwarfs, 
hold great potential for mapping out the baryonic mass and 
uncovering the dynamical structure of the Galactic disk and 
halo. Their spectral energy distribution is also very sensitive 
to metallicity variations, which can potentially be used to map 
out th e chemical evolution of the Galaxy (Bochanski et al. 
l2013i ). However, because M dwarfs are relatively dim stars, 
with absolute visual magnitudes 8<My <15 (jLepineil2005 ). 
and because current large-scale spectroscopic surveys are ef¬ 
ficient only for stars with magnitudes V < 20, the effective 
distance range over which M dwarfs can be studied in very 
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large numbers only extends to ~ 1—2 Kpc. 

Large multi-object spectroscopic surveys such as the 
Sloan Extension for Galactic Understanding and Evolu¬ 
tion (SEGUE, lYannv et~akl 120091) have made considerable 
progress in uncovering the large-scale structure of the Galaxy. 
This was achieved by targeting specific tracers of dis- 
tant_stellar populations such as main-sequence turn-off stars 
dNewberg et al.ll 12002 ). Chemical abundance and kinematic 
surveys were also performed by targeting main sequence stars 
of intermediate mass such as F and G stars dLee et al.ll201 ll) . 
A spectroscopic catalog of M dwarfs identified in the Sloan 
Digital Sky Survey (SPSS. [York et al .1120001). and comprising 
more than 70,000 stars, was presented by lWest et al.l (1201 ll) . 
which showed that the metallicity of early type M dwarfs de¬ 
creases as a function of vertical distance from Galactic plane. 

As in SDSS/SEGUE, M dwarfs also make excellent targets 
for large jnulti-object spectroscopic surveys such as LAM¬ 
OST dCui et al.l 120121) . This is because M dwarfs are very 
abundant in every direction on the sky, and especially at low 
Galactic latitudes where extra-galactic and Galactic halo sur¬ 
veys have low target densities. They can thus provide abun¬ 
dant targets for dedicated stellar surveys, and can also be used 
as filler targets for other programs. 

Since large numbers of M dwarf spectra are therefore ex¬ 
pected to be produced in the LAMOST survey, it is impor¬ 
tant to develop automated and reliable tools to identify them 
and provide basic physical properties such as temperature and 
metallicity. We are currently developing a spectroscopic anal- 
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ysis pipeline to classify M dwarfs and the more metal-poor 
M subdwarfs. Our goal is to provide fine-scale spectral sub- 
types and molecular band ratios in order to place the stars on 
a temperature-metallicity grid. This paper outlines the basic 
method we have adopted, which is based on the use of high 
signal-to-noise classification templates assembled from pre¬ 
vious spectroscopic surveys. To test the analysis pipeline, we 
have searched for M dwarfs/subdwarfs in the LAMOST com¬ 
missioning data, using M dwarf/subdwarf templates assem¬ 
bled from the SDSS spectroscopic survey. We present here 
a catalog of 2,612 M dwarfs and M subdwarfs identified us¬ 
ing our automated, template-fit method, and perform simple 
distance and kinematics analysis which highlights the poten¬ 
tial of LAMOST for the study of low-mass stars and Galactic 
dynamics. 

2. THE LAMOST COMMISSIONING DATA 

The Large sky Area Multi-Object fiber Spectroscopic Tele¬ 
scope (LAMOST), also named the Guo Shou Jing Telescope, 
is a quasi-meridian reflecting Schmidt telescope which pro¬ 
vides a field of view of up to 20 square degrees, over which up 
to 4,000 optical fibers can be automatically positioned, feed¬ 
ing light to 16 multi-object spectrographs. Because of the 
large aperture optics design, the telescope can produce 4,000 
spectra in a single exposure, and is designed to reach a lim¬ 
iting magnitude of about r=19 in 90 minutes exposu res, for 
spectra with a resolution R=1800 ( IZhao et al. l 120121) . With 
an average expected rate of 6 exposures per night, LAMOST 
will generate millions of spectra from stars and galaxies every 
year, making it a powerful survey telescope. 

A commissioning survey was implemented to test the ca¬ 
pabilities of the LAMOST telescope and verify its ability to 
target and observe 4000 stars at a time. To test the operation 
in survey mode, 6 fields have been selected at moderately low 
Galactic latitudes, which provide an abundance of bright tar¬ 
gets. The fields are distributed near the Galactic anti-center, 
with 119° < gl < 226° and -25° < gb < +37°. Each of the 
field was observed 4-16 times in the course of the commis¬ 
sioning survey, each time targeting a different set of 4,000 
stars in each field. 

In this testing phase, however, only about 3500 of the 4000 
fibers were available for science targets. Some fibers were 
offline or could not be moved at the time for a number of 
reasons, while other fibers were trained on patches of back¬ 
ground sky to measure local sky brightness, and evaluate the 
contamination from sky lines from both natural and artificial 
sources. The 3500 fibers available for science targets were 
used on different objects in every exposure, as much as possi¬ 
ble, although a few stars ended up being observed twice. 

In the end, a total of 165,219 spectra from science targets 
were collected from the 48 field exposures. However, only 
a small fraction of the spectra were found to have sufficient 
signal-to-noise ratio to identify them as stars. Other spectra 
were dominated by sky lines. This was found to be due to 
inefficiencies in the initial LAMOST fiber positioning algo¬ 
rithm, which resulted in significant 

Because of various instrumental and calibration problems, 
the absolute wavelength calibration in the commissioning data 
cannot be determined to better than a few angstrom for any 
star. This issue ha s been improved in the pilot sur vey and 
regular survey, e.g., IZhao et al1l2012t ICui et al.1l2012l and the 
typical error of the radial velocity is about 5 km s -1 (Luo et 
al. in prep). 

Despite a few shortcomings in the test observations, the 


TABLE 1 

Summary of the commissioning observation 


FIELD 

RA(2000)' 

DEC(2000) 

N_obs 2 

S/N 3 

I 

91.028979 

23.264102 

16 

14.39 

n 

124.75370 

56.254898 

11 

9.08 

hi 

18.136639 

45.335527 

7 

11.42 

IV 

42.884911 

35.057388 

5 

12.77 

V 

11.162321 

40.679614 

5 

9.09 

VI 

124.40772 

0.477676 

4 

13.94 


1 Center point of celestial coordinates for the field. 
- Observed frequency of field. 

3 Mean signal-to-noise ratio of field. 


LAMOST commissioning spectra still produced useful carry 
out some scientific research, such as the dis£overy of 17 new 
planetary nebulae in the ed ge of the M31(IYuanet al.ll2010l) . 
14 new quasars near M3 1 iiHu o et alJl 2Q10h . 8 new quasars 
in the extragalactic field (IWu et al.ll2010allbl). and 9 candidate 
metal-poor stars with [Fe/H] < -1.0 dLi et al.l2010l) . This sug¬ 
gest that the LAMOST commissioning data maybe has a suf¬ 
ficient quality for us to perform the M dwarfs/subdwarfs iden¬ 
tification and classification. 

Table Q] summarizes information on the six fields that were 
observed, including the coordinates of the field centers, the 
number of times each field was surveyed, and the mean signal- 
to-noise ratio for the spectra obtained in the field. The mean 
signal-to-noise ratio is calculated by averaging the photon 
counts from all the science fibers, which include those where 
only sky lines were detected (i.e. fibers targeting stars but for 
which the alignment was incorrect, resulting in only sky being 
detected). The quoted S/N values therefore underestimate the 
true potential of LAMOST. 

3. M DWARF SEARCH AND CLASSIFICATION 
3.1. Spectral Templates 

Formal spectral classification of M dwarfs is, in principle, 
based on a direct comparison with classification standards that 
have been assembled over the years (lMo uldlll976t Bessel! 
ri98l IKirknatrick etakl[19911 [1995 : Bocha nskiet al] 120071) . 
Formal sets of primary and secondary classification standards 
for the M subdwarf (i.e. metal-poor) sequences have also been 
proposed in ILenine. Rich, & Sharal !2007l) . Visual classifica¬ 
tion is however not practical for large spectroscopic surveys 
where thousands of spectra are collected every night. 

Automated classification methods have been developed 
over the years based on measures of the strength of 
TiO, CaH, and VO bands, which dominate the optical 
spectra of M dwarfs. Spectra l indices were introduced 
by iReid, Hawlev. & Gizisl (119951) and ILenine. Rich, & Sharal 
(120031120071) and relationships between values of theses in¬ 
dices and spectral subtypes have been calibrated, such that 
subtypes can in principle be estimated from spectral index 
measurements alone; a process that can be automated. A 
drawback of the technique is that artifacts in the spectra (sky 
lines, cosmic rays, calibration errors) can result in significant 
errors in the spectral subtype estimates. In addition, differ¬ 
ences in spectral resolution and spectrophotometric calibra¬ 
tion from different telescopes/instruments can result in sys¬ 
tematic errors in the measurement of the spe ctral indices, 
which require careful recalibrations (iLepine et alj|2013l) . 

To help in the visual verification of automated spectral clas¬ 
sification results, a specialized software was developed for the 
Sloan Digital Sky Survey, called "The Hammer" (ICovev et alJ 
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Fig. 1.— Sequence of synthetic classification templates assembled from SDSS spectra. Each of the four sets above shows a temperature sequence, with MO.O 
stars being the warmest and M8.0 stars the coolest in the sequence. The four classes (dwarfs:dM; subdwarfs:sdM; extreme subdwarfs:esdM; and ultrasubd- 
wars:usdM) allegedly represent a metallicity sequence, with the dwarfs (dM) being the most metal rich and the ultrasubdwarfs (usdM) the most metal-poor. 
Late-type templates are noisier due to the smaller number and higher signal-to-noise of the SDSS spectra that were co-added to generate the templates. 
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120071) . The Hammer performs spectral classification using a 
set of spectral indices which assign a tentative subtype. Spec¬ 
tra can then be displayed on a graphical interface and allow 
a user to identify errors and correct the spectral typing from 
visual inspection. Using this method, [West et all (1201 li) vi¬ 
sually inspected over 70,000 M dwarfs from the SDSS DR7. 
The method still requires considerable investment of human 
resources however, but was applied with some success in 
a search for late-type stellar spectra in LAMOST datasets 
(lYi et alJl2014lt . 

As an alternative to these classification methods, we are 
proposing to develop a procedure for automated classifica¬ 
tion which provides reliable spectral subtypes without prior 
measurement of spectral indices, and without the need for vi¬ 
sually inspecting every spectrum. Our method rather relies 
on direct fits to spectral classification templates assembled by 
combining spectra of M dwarfs and M subdwarfs identified 
in the Sloan Digital Sky Survey. Our general philosophy is to 
identify and classify M dwarfs using all the available spectro¬ 
scopic information within the most relevant wavelength range 
for these stars, by performing a fit to identify the template 
which best matches the data. 

Our templates were assembled from SDSS spectra of rela¬ 
tively bright (r< 18) late-K and M dwarfs, most drawn from the 
su bset of M dwarfs r eleased in the SDSS DR7, an d presented 
in IWest et alJ (1201 ll) . In the catalog presented by IWest et al.1 
(1201 ll) . the M dwarfs were classified using the Hammer code, 
with subtypes listed to the nearest integer; instead of using 
these subtypes, we re-classified all the stars based on the spec¬ 
tral index method described in iLepine, Rich, & Sharal (120031 
120071) . which is based on measurements of the three spec¬ 
tral band indices Ti05, CaH2, and CaH3, which measure the 
strengths of the TiO and CaH molecular bands near 7000A. In 
a first pass, the spectral indices were measured, the stars clas¬ 
sified, and spectra were co-added to produce an initial set of 
dM/sdM/esdM/usdM classification templates. Atomic lines 
from the Ca II triplet (A= 8498, 8542, 8662A) were measured 
on the templates to determine any systematic radial velocity 
shift, and the templates were shifted back to the stellar rest 
frame. In a second pass, each SDSS spectrum was cross- 
correlated with its matching template, and the radial velocity 
shift for each star was measured (generally to a precision of 
+/- 10 km/s). Spectral indices were then re-measured for each 
star shifted back in the local rest frame, and each star was re¬ 
classified. A new set of classification templates was then gen¬ 
erated. This procedure was repeated for a third pass. After 
the third pass, the final co-adds were selected as formal clas¬ 
sification templates; the final number of SDSS spectra used to 
define each classification template is shown in Table[2] 

A subset of the final templates is presented in Figures Q]and 
[2] The resulting templates span the spectral subtypes K7.0- 
M8.5, and cover every half-subtype in those ranges. These 
initial templates are arranged in a two-dimensional grid, with 
one axis measuring the general strength of the molecular 
bands and the other axis measuring the ratio of the TiO and 
CaH band strengths. The first axis (spectral subtype) is pre¬ 
sumed to be mainly correlated with a star’s effective temper¬ 
ature. The second axis, on the other hand, is presumed to be 
mainly correlated with a star’s metallicity, although this need 
not be strictly the case; for instance, the ratio of TiO to CaH 
can also be a function of surface gravity. In any case, we 
will assume that our classification system essentially repre¬ 
sents a "temperature-metallicity" grid. With over 18 elements 


TABLE 2 

Number of SDSS spectra combined to make the 

CLASSIFICATION TEMPLATES 



d... 

sd... 

esd... 

usd... 

...K7.0 

1,755 

12 

ii 

13 

...K7.5 

1,870 

12 

16 

14 

...MO.O 

1,651 

17 

20 

29 

...MO.5 

785 

14 

18 

10 

...M1.0 

688 

22 

20 

17 

...Ml.5 

843 

17 

25 

17 

...M2.0 

601 

18 

20 

11 

...M2.5 

711 

21 

16 

8 

...M3.0 

299 

41 

24 

8 

...M3.5 

813 

27 

20 

8 

...M4.0 

813 

21 

6 

8 

...M4.5 

643 

6 

7 

2 

...M5.0 

643 

4 

2 

5 

...M5.5 

778 

4 

3 

2 

...M6.0 

478 

2 

2 

2 

...M6.5 

26 

1 

5 

1 

...M7.0 

31 

1 

2 

1 

...M7.5 

29 

2 

2 

1 

...M8.0 

20 

2 

2 


...M8.5 

16 

2 




of resolution in "temperature", and 4 elements of resolution in 
"metallicity". (We note that the term "metallicity" is only used 
here to suggest that the variations in TiO to CaH band strength 
ratios are most likely due to variations in a star’s chemical 
composition. It is clear that these specific ratios may not be 
strictly correlated with the abundance of iron, so our use of 
the word "metallicity" here is not meant to suggest that val¬ 
ues of [Fe/H] can be inferred from those grids.) The limited 
resolution along the "metallicity" axis is due in part to the dif¬ 
ficulty in measuring accurate metallicity values in individual 
stars using the Ti05 and CaH2+CaH3 spectral indices alone, 
combined with the limited signal-to-noise ratio in the indi¬ 
vidual stars. Our co-added templates however provide signifi¬ 
cantly higher signal-to-noise, which make it possible to define 
a finer "metallicity" scale. 

To increase the number of metallicity grid points from 4 to 
12, we used the following procedure. For any one of the four 
original metallicity classes (i.e., dM), each can be expanded 
to three subclasses: a relatively more metal-rich subclass (i.e., 
dMr), a standard subclass (i.e. dMs), and a relatively more 
metal-poor subclass (i.e., dMp). Assuming that the metallic¬ 
ity gr id is a linear system (see more details in iMann et afl 
120121) , the extra synthetic templates can be created by extrap¬ 
olating between two neighboring classes. For example, we 
can synthesize dMp templates by considering them to be one 
third of the way from the dM standard class to the sdM stan¬ 
dard class: 

[sdMs] — [dMs] 2 1 

[dMp] = --y-t-- + [dMs] = - [dMs] + - [sdMs] (1) 

where dMs and sdMs are the original template spectra of the 
dwarfs and subdwarfs. We can thus interpolate between dMs 
and sdMs to create dMp and sdMr subclass templates, be¬ 
tween sdMs and esdMs to create sdMp and esdMr subclass 
templates, and between esdMs and usdMs to create esdMp 
and usdMr subclass templates. To create the dMr and usdMp 
templates, we can extrapolate them from the original tem¬ 
plates assuming a linear progression at the edges. To make 
the dMr template, we can assume that the dMs template is be¬ 
tween dMp and dMr, which means dMs = ([dMp] + [dMr]) /2, 
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Fig. 2.— Subsets of classification templates for stars of similar subtypes but different metallicities. We define twelves different metallicity subclasses, all 
shown here. These range from dMr to usdMp, from the most metal-rich to the most metal poor. Four of the temperature subtypes (0.0, 2.5, 5.0, 7.5) are selected 
to illustrate the difference of metallicity classes. The late subtype templates (7.5) are noisier because of the smaller number and lower S/N of the SDSS spectra 
that were combined to generate them. 
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and using the Eq. [0 the dMr template can be derived: 

4 1 

[dMr] = 2 [dMs] - [ dMp] = — [dMs] — — [sdMs] (2) 

And the usdMp subclass template can be similarly synthe¬ 
sized by the same extrapolation from the other edge. 

To verify the validity of our synthetic method, the distri¬ 
bution of Ti05 and CaH2+CaH3 spectral indices from all 
the synthetic templates is shown in Figure [3 Four different 
colors represent the four original metallicity classes, which 
are dwarfs (blue), subdwarfs (yellow), extreme subdwarfs 
(green), and ultra-subdwarfs (red), from left to right on the 
plot, respectively. For each metallicity class color, the mid¬ 
dle line is the spectral index distribution of the correspond¬ 
ing original template sequence, while the leftmost line corre¬ 
sponds to the relatively more "metal-rich" subclass sequence 
(r), and the rightmost line corresponds to the relatively more 
"metal-pool" subclass sequence (p). The three black lines are 
the boundaries proposed by iFe pine et all (120131) to separate 
the four original metallicity classes. This shows that our syn¬ 
thetic templates not only define a finer "metallicity" grid but 
also still follow the original classification criteria within each 
metallicity subclass. However, for later subtypes (dM7.0- 
dM8.5, sdM7.0-sdM8.5), the relative rarity of SDSS spec¬ 
tra having appropriate "temperature" and "metallicity" val¬ 
ues results in the templates being dominated by the spectra 
of just a few stars, which can skew the mean "metallicity" 
and "temperature" of the subsample, due to fluctuations from 
small number statistics. On the other hand, the low S/N ra¬ 
tio of late-type template spectra also leads to larger errors in 
the measurement of spectral indices and limits their value as 
a subtype/metallicity test (Figure [3}. Therefore, metallicity 
class estimates at the latest subtypes based on these templates 
should be used with caution. 

We emphasize here again that we use the terms "tempera¬ 
ture" and "metallicity" in our classification system as an ed¬ 
ucated suggestion for what we believe underlies the two di¬ 
mensional classification grid. These are not meant to suggest 
that each grid point represents a unique combination of the 
physical parameters Teff and [Fe/H]. What this means is that 
this proposed classification system should be used for classi¬ 
fication purposes, and not as a means to determine physical 
parameters for a star. The determination of physical parame¬ 
ters from this classification would first require a validation and 
calibration of the grid. This is in fact an ongoing area of re¬ 
search in the field of c ool stars which has seen s o me progress 
in recent yea r , see e.g.iRoias-A vala et al] (120121 ) : iMann et all 
(120 13L 120141) : iNewton et al.l (12015k . It should be understood 
in the following section that the term "temperature" is used in 
the sense of "spectral subtype", the first axis in our classifica¬ 
tion grid, and that the term "metallicity" is used to described 
the second axis in our classification system, which really de¬ 
scribes molecular band ratios. 

3.2. Method for template fitting 

We perform our spectral classification by calculating the 
chi-square value between any one FAMOST spectrum and 
each of the template spectra. The spect ra are first reduced 
using the FAMOST 2D pipeline dLuo et al.ll2004l) . which in¬ 
cludes bias subtraction, Hatfield correction, skyline subtrac¬ 
tion, wavelength calibration, and flux calibration (iZhao et al.l 
120121) . For best results, we have determined that the spec¬ 
tral fitting should avoid the overlap region between the blue- 
channel and red-channel spectra, which is between 5700Aand 
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Fig. 3.— Distribution of the Ti05 and CaH2+CaH3 spectral indices in the 
classification templates. The four colors represent the four original metallic¬ 
ity classes: dwarfs (blue), subdwarfs (yellow), extreme subdwarfs (green), 
and ultrasubdwarfs (red). Our templates further subdivide each metallicity 
class into three subclasses: relatively more metal-rich subclass(Mr), stan¬ 
dard subclass(Ms), and relatively more metal-poor subclass(Mp), from left 
to right on the plot, respectively. Indices for the three subclasses are shown. 
Segments connect the index values of templates from the different spectral 
subtypes within each of the metallicity sub classes. The boundaries of the 
four metallicity subclasses, as proposed by ILenine et alJ d2013f) are shown 
as black lines. Although ou r templates were originally assembled by the 
ILepine, Rich, & Sharal 420071) calibration, which makes a small offset be¬ 
tween a few early-type synthetic templates and the boundary lines, the dif¬ 
ferences were not significant. Overall, our synthetic templates define a finer 
metallicity grid, while still following the original subclass separation. 




Fig. 4.— Distribution of estimated mean S/N ratio in the LAMOST spectra 
for 2,612 M dwarf s/subdwarfs. The sub-panel display the mean S/N distribu¬ 
tion in the range 0 <S/N< 10. 


5900A( IWu et al.li201 1), because of noise and flux calibration 
issues. The spectral fit should also avoid the wavelength range 
between 8000A - 9000Awhere serious skyline contamination 
occurs. The spectral wavelength range used for the classifica¬ 
tion fit is thus limited to 6000A-8000A. Bad pixels, flagged 
by the FAMOST reduction pipeline, are also excluded from 
the fit. The chi-square for the jth template spectrum is defined 
as: 



where L, is the FAMOST spectral flux for each of the i= 1 ,N 
pixels, Tjj is the flux of the /'th template spectrum for every 
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Dote=20091114 Plate = 02bright_01 Spectroscopy=5 Fiber=198 Ra = 16.537870 Dec=44.493912 S/N = 25.3 



6000 6500 7000 7500 8000 6500 7000 7500 8000 

Wavelength (A) Wavelength (A) 


Fig. 5.— Quality of the fit for a representative high S/N spectrum in the LAMOST commissioning data, identified to be an M3.5 dwarf by our template-fitting 
method. Top panel : the full spectrum, which is a composite of the LAMOST blue channel and red channel spectra. Bottom panels : the smoothed LAMOST 
spectrum (black) overlaid on classification templates selected to demonstrate the quality of the fit (red). Bottom left : Three adjacent templates of the same subtype 
but different metallicity subclasses. The middle one is the best fitting template; the one above is the relatively more metal-poor template, and the one below 
is the relatively more metal-rich template, as labeled. The difference between top/bottom templates and the LAMOST spectrum is clear in the 6000A-6200A, 
6900A-7100A(CaH and TiO band) and 7400A-7600A(VO band) ranges. Bottom right : Three adjacent templates of the same metallicity subclass but different 
spectral subtypes. The middle one is the best fitting template, while those above/below represent templates of higher/lower temperatures in the sequence. The 
most significant difference is in the 6000A- 6200Arange. 


corresponding pixel i, and er, is the error for the LAMOST 
spectral flux at each pixel i. N is equal to 1250 because there 
are 1250 pixels between 6000Aand 8000A. 

In order to correct for radial velocity shifts, and reduce 
the possible effects of faulty wavelength calibration (See §2 
above), we have to makesure that the stars are classified in 
their local rest frame as much as possible. We therefore shift 
each LAMOST spectrum towards the blue or red by an integer 
number of pixels, and recalculate the chi-square value against 
each template spectrum. For each LAMOST spectrum, we 
allow for a possible maximum shift of ±8 pixels, which cor¬ 
responds to maximal radial velocity shifts of approximately 
±550 km/s. The factor combination of pixel shifting and /'th 
template spectrum which gives the minimum chi-square value 
is selected as the best fit. 

Stars are rejected when their best fit is a match to a non-M 
dwarf template. For each of the 12 metallicity subclasses, 
there are two earlier template spectra (called non-M dwarf 
templates) corresponding to K5.0 and K5.5 dwarf/subdwarf 
subtypes; these templates have extremely weak molecular fea¬ 
tures of titanium oxide (TiO), vanadium oxide (VO) and cal¬ 
cium hydride (CaH). For stellar types earlier than K7.0, these 
molecular bands are indeed so weak as to be generally undis- 
cernible, except in spectra with very high S/N, and the spectra 
essentially all look relatively flat in the 6000A- 8000Arange 
dReid, Hawlev. & Gizisll 19951) . As a result, any star with a 
subtype earlier than K7 (including all F and G stars) will have 
its best match with any one of the K5.0 or K5.5 templates. 
Any star with a best match to those templates is therefore 
identified as a non-M dwarf, and rejected. 

Since the quantum efficiency for CCDs in LAMOST is in¬ 
creasing over the wavelength range from 6000Ato 8000A, the 
measurement errors for each pixel have a systematically de¬ 
creasing trend over this wavelength range. So when we use 
the LAMOST spectrum’s chi-square fit to the template, the 


red part close to 8000Ahas more weight than the blue part 
closer to 6000A. On the other hand, most of the prominent 
molecular absorption features are in this bluer part. This 
means that some non-M dwarf stars which happen to have 
similar spectra on the red side around 8000Amight be clas¬ 
sified as M dwarfs, even if there is no good fit on the blue 
side. This is more common for low S/N spectra, because their 
blue edge is dominated by the instrumental noise, and their 
red edge is as featureless as most of the templates, and thus 
they appear more similar to the early-M dwarf templates than 
to the late-K ones. 

To solve this problem, we use an additional criterion to 
identify and reject non-M dwarfs with noisy spectra that 
might otherwise be classified as M dwarfs from the chi-square 
fit. In addition to the chi-square value, we also calculate 
the integrated square of the difference between the LAMOST 
spectrum and the best classification template. If the integrated 
square difference is greater than 130 pixel units, then the star 
is considered a non-conclusive classification and is excluded 
from the final M dwarf list. Specifically, this means that we 
allow for a maximum difference between a LAMOST spec¬ 
trum and a classification template of 10% for each pixel on 
average ; for the normalized spectra, the corresponding differ¬ 
ence is 0.1 units. Because there are about 1250 pixels used in 
the fit, the threshold value in the fitting area is about 130 pixel 
units. Our experience shows that this procedure eliminates the 
vast majority of noisy, but clearly non-M dwarf stars that may 
otherwise be identified as M dwarfs from the chi-square fit. 

3.3. Additional Spectrophotometric Correction 

We find that to obtain the best results with our template 
fit methods, both the LAMOST spectra and the classification 
templates also need to be flux-normalized before the fit, i.e. 
the spectra and templates need to be rectified with respect to 
a pseudo-continuum level. This ensures that the classification 
is primarily based on the absolute and relative strength of the 
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Date-20091019 Plate-03faint Spectroscopy-3 Fiber-98 Ro-9.4752510 Dec-40.572015 S/N-9.5 



Fig. 6.— Quality of the fit for a representative LAMOST spectrum of intermediate S/N. The star happens to be classified as sdMr2.5 by our pipeline. Panels 
are the same as in Figure[5] except that additional templates extending ±1 metallicity subclasses and ±1.0 spectral subtypes from the best fit are also shown for 
comparison. 


molecular bands, and not on the slope of the spectral energy 
distribution, which can be affected by many factors such as 
errors in instrumental flux calibration, aperture losses com¬ 
bined with atmospheric differential refraction, or interstellar 
reddening. 

Considering that the molecular bands completely dominate 
the spectral energy distribution within the fitting area, it is 
hard to objectively define a continuum for rectification. We 
therefore use a more simple flux recalibration that plays a sim¬ 
ilar role and performs a first order correction of the slope. 

For each spectrum, we divide the spectral fitting range in 
two parts, and calculate the integrated flux within each of 
these two regions. Then the straight line y = ax+b that fits 
the two median flux points is used to calibrate the flux and 
the slope of this spectrum. The parameters a and b are given 
by: a = {F r —Ff)/(\ r - Ab), b = F^-aX^, where F T is the median 
flux of the red section (7000A-8000A), /- h is the median flux 
of the blue section (6000A-7000A), and A r , Ab are the median 
wavelengths within each of the two regions. The same flux 
recalibration is performed on all the classification templates, 
and the chi-square best fit is performed for the normalized 
spectrum as it is fit on the normalized templates. 

To evaluate the magnitude of this flux correction, we first 
define R as the ratio of the spectral fluxes, R = Ff /- h , where F r , 
F\, are the median fluxes which have been defined before. We 
then introduce fc as the ratio of the red/blue flux ratio R for 
the original LAMOST spectrum (i.e. before flux correction) 


and the red/blue flux ratio R for the classification template 
which provides the best fit to the data (with the fit performed 
after flux recalibration). This factor of the flux correction fc 
is thus defined as: 


fc = 


^LAMOST 

Rsdss 


(4) 


where Rlamost is the spectral flux ratio of the LAMOST 
spectrum and Rsdss is the flux ratio of the classification tem¬ 
plate. Using this flux correction factor, we can verify whether 
the LAMOST flux calibration and SDSS flux calibration are 
similar {fc = 1), indicating that the LAMOST spectrum has 
the same measured spectral energy distribution as an SDSS 
spectrum of the same spectral subtype. A value different from 
fc = 1 would indicate that star has a spectral energy distribu¬ 
tion that is different than that of a classification standard star 
of the same spectral type. The value of fc can thus be used, 
e.g., to diagnose reddening in a star, which would produce a 
value of fc < 1. 

Again, the final template fit is performed after these addi¬ 
tional flux corrections. Stars which are found to match one 
of the templates are then automatically assigned a metallicity 
class (dMr, dMs, dMp, sdMr, sdMs, sdMp, esdMr, esdMs, es- 
dMp, usdMr, usdMs, usdMp) and spectral subtype (0.0 to 8.5) 
from the class and subtype of the template which provided the 
best fit. We also include stars classified with subtypes K7.0 
and K7.5, and of all metallicity subclasses. 
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Date-20091122 Plate-6f01 Spectroscopy-1 Fiber-120 Ra-125.58749 Dec-54.372974 S/N-2.9 
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Fig. 7.— Quality of the fit for a typical LAMOST spectrum of low S/N. The star happens to be classified as an extreme subdwarf (esdMpO.O) by our pipeline. 
Because of the serious skylines contamination and the weakness of the molecular bands, it is hard for our template-fit method to classified the spectrum with the 
same accuracy as the higher S/N spectra shown in Figure[5]and Figure[6] In any case, close examination of neighboring templates (above and below the best fit 
template shown in the middle) reveals difference that are significant enough near the 7,000Aregion to be confident of a classification accuracy to within ±1.0 
spectral subtypes and within ±2 metallicity subclasses. 


4. APPLICATION AND VERIFICATION OF THE CLASSIFICATION 
METHOD 

4.1. Search results 

We performed spectral classification of all the spectra col¬ 
lected during the LAMOST commissioning phase, using our 
template-fit method. Of the 83,500 spectra which were passed 
through the M dwarf classification pipeline, 2,612 were pos¬ 
itively identified as M dwarfs/subdwarfs by our classification 
code, 74,573 spectra (89.3%) were excluded as non-M dwarfs 
by the template fitting method, and 6,315 spectra (7.6%) were 
rejected as too noisy for classification, based on our integrated 
square-difference criterion. 

Superficial examination of the spectra from the 2,612 M 
dwarf candidates shows that all of them display TiO and CaH 
bands, which confirms that they are all consistent with M 
dwarf/subdwarf stars. Examination of some of the 80,888 re¬ 
jected spectra, on the other hand, reveals a variety of other 
sources, some looking like stars of earlier spectral types such 
as AFGK stars and white dwarfs, others showing only sky 
lines, or spectra that are too noisy to be identified. In any 
case, none of the rejected spectra are found to show clear ev¬ 
idence of TiO or CaH molecular bands, and are thus not M 
stars. 

4.2. Sample spectra 


To verify the quality of the template fits and how effectively 
M dwarfs/subdwarfs are classified with our pipeline, we per¬ 
formed close visual inspection of a number of spectra cover¬ 
ing a broad range of spectral subtypes (as determined from 
our fit) and with a range of integrated square differences to 
the classification template. 

Since the classification accuracy is mainly influenced by 
the S/N ratio for each spectrum, the S/N distribution for 
2,612 M dwarfs/subdwarfs is shown in Figure [4] Based on 
this histogram, we can divide the M dwarfs/subdwarfs clas¬ 
sified in our pipeline into three subsets: the high S/N spectra 
( S/N > 15), the intermediate S/N spectra (5 < S/N < 15), and 
the low S/N spectra ( S/N < 5). To illustrate the fit quality of 
different S/N subsets, we plot three example spectra and their 
fitting templates in Figures [5] [6) and [7] 

FigureQdisplays the LAMOST high S/N spectrum of a star 
classified as dM3.5 by our pipeline (black line). The spec¬ 
trum is compared to the best-fitting template (red line). The 
best-fitting template (dM3.5) shows a near-perfect fit to the 
LAMOST spectrum, and in particular is an excellent match 
to the main molecular absorption bands (TiO, CaH) between 
6000Aand 8000A. To evaluate the quality of the fit, we also 
compare this LAMOST spectrum with the four adjacent tem¬ 
plates in the spectral subtype / metallicity class grid, which 
are overlaid on the LAMOST spectrum. There are subtle 
but clear differences between each of these adjacent tem- 
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FIG. 8.— Distribution of the spectral index measuring the bandhead of the TiO band redward of 7040A(TiO5) and the sum of the indices measuring the strength 
of the CaH bandhead blueward of 7040A(CaH2+CaH3). Different symbols are used for stars classified in the LAMOST classification pipeline as dwarfs (dM), 
subdwarfs (sdM), extreme subdwarfs (esdM), and late-K dwarfs. As expected, many of the subdwarfs and extreme subdwarfs fall to the right of the dM dwarfs 
sequence, due to the TiO/CaH band ratio being metallicity dependent. Some stars classified as M subdwarfs have indices more consistent with dM dwarfs; those 
stars tend to have lower S/N and could be misclassified, or suffer significant measurement errors on the spectra l indices. The three solid black lines delimitate the 
loci of the four metallicity classes (dM, sdM, esdM, usdM, left to right) originally proposed by ILcpinc et all 12013 ). Two outliers, classified as dM dwarfs but 
falling outside the normal distribution of dM stars in this diagram (open blue triangle and diamond symbols), are investigated in Figure[9] 


plates and the LAMOST spectrum, which demonstrate that 
the class/subtype dM3.5 provides the best possible classifica¬ 
tion for the star. This suggest that for high-quality data, our 
classification method is indeed accurate at least to the nearest 
half-subtype, and to the nearest metallicity subclass. 

Figure [6] shows the case of a LAMOST spectrum of inter¬ 
mediate S/N, which happens to be identified as an M subd¬ 
warf, with a sdMr2.5 classification. Although the differences 
between early-type dwarfs in the temperature-metallicity grid 
are not as clear as for the dMs3.5 star in Figure 0 there are 
still notable differences which indicate that the fit is not as 
good for the adjacent templates. With the differences between 
the LAMOST spectrum and the templates increasing signifi¬ 
cantly for templates beyond those displayed in Figure [6] we 
remain confident that the classification also has an accuracy of 
at least +/-0.5 subtypes, and is accurate to within the nearest 
metallicity subclass. 

Figure [7] shows the low S/N spectrum of a LAMOST star, 
identified as an extreme subdwarf with a classification es- 
dMpO.O according to our code. The weakness of the molec¬ 
ular bands and the lower S/N makes it harder for the star to 
be accurately classified, and there is more uncertainty in the 
assigned spectral type and metallicity class. In any case, there 
are significant differences between the star and templates that 
are two steps away from the assigned subtype/subclass. From 
this we are confident that the classification is still accurate 
to within +/- 1.0 subtypes, and to within ±2 metallicity sub¬ 
classes. 

Our pipeline has classified 2,612 M dwarfs/subdwarfs in 


total. The accuracy for low S/N spectra classification are no 
better than ±1.0 subtype or ±2 metallicity subclasses as de¬ 
fined in this paper, however these low S/N spectra (S/N <5) 
are a minority and only represent 25.8% of the catalog (675 
stars). The remaining 1,937 stars have spectra with sufficient 
S/N ratio (>5) to be accurately classified to within ±0.5 sub- 
type or ±1 metallicity subclass. 

4.3. Measurement of spectral indices 

As an additional verification of our classification method, 
we have calculated sets of spectral indices commonly used 
for M dwarf/subdwarf classification. The Ti05, CaH2, and 
CaH3 spectral indices, as defined in [Reid. Hawley. & Gizisl 
(fl995l) . were calculated for all the LAMOST stars identified 
as M dwarfs/subwarfs by our classification pipeline. 

The distribution of Ti05 against CaH2+CaH3 was orig¬ 
inally used to formally separate metal-poor M subdwarfs 
from metal-r ich M dwarfs (lGizisll997llLepine. Rich. & Sharal 
120031 120071) . We plot this distribution for all the LAM¬ 
OST M dwarfs/subdwarfs in Figure [8] using different sym¬ 
bols/colors for stars of different metallicity classes. We note 
that the distribution's in good agreement with Figure 3 of 
iLepine. Rich. & Shara ( 2007 ). and confirms that most of the 
LAMOST stars are relatively metal-rich M dwarfs and late- 
K dwarfs. The LAMOST stars classified as subdwarfs gen¬ 
erally have higher Ti05 index value for a given range of 
CaH2+CaH3 values, also in general agreement with the spec¬ 
tral index classification system. 

However, there are several outliers, e.g., stars classified as 
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Date=20091117 Plate=4b2 Spectroscopy= 16 Fiber=117 Ra=42.047108 Dec=36.833923 S/N=16.9 




Date=20091020 Plate=02faint Spectroscopy=6 Fiber=83 

Ra = 20.702248 Dec=45.028671 S/N = 0.9 
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Fig. 9.— Spectra of the two outlier stars marked in Figure [8] Top : spectrum of the outlier identified as an open triangle symbol. The weaker molecular bands 
of CaH, consistent with the location in the TiO/CaH diagram of Figure [8] indicate that it is most probably an M giant instead of an M dwarf. Bottom : spectrum of 
the outlier represented by an open diamond symbol. Because of the low S/N ratio, the skylines occurring in the CaH molecular bands result in significant errors 
in the spectral index measurements. The template-fit classification appears to be quite good however. 



Fig. 10.— Spectral index distribution of dM dwarfs for the three different metallicity subclasses defined in this paper. The different loci of the dMp, dMs, and 
dMr subclasses clearly show that the metallicity subclasses defined in our spectral fitting method also consistently separate out the stars in the spectral index grid, 
according the expected trends for metallicity variations. 


dM by our pipeline but with a [TiO,CaH2+CaH3] index more 
consistent with an sdM subtype. We carefully examine two of 
these to understand the discrepancy. The two stars represent 
two different kinds of offset along the dM sequence. The top 
panel of Figure [9] displays the spectrum of the star identified 
as an open triangle symbol in Figure [8] The difference in the 
molecular absorption bands between the star and any of the 
dM templates suggests that it is probably an M giant instead of 
M dwarf; the weaker molecular bands of CaH are consistent 
with the M giant spectra analyzed by iMann et al.l (120121) . In 


this particular example, it is our classification pipeline that is 
in error, and the TiO/CaH spectral index ratio more correctly 
identifies the star as M giant. It is hard for our automated 
classification pipeline to exclude these M giant stars with the 
template-fit method, because the dwarf templates always pro¬ 
vide the best fit to the spectra and the molecular bands are 
similar. Our best option would be to use M giant templates 
in addition to the M dwarfs/subdwarfs. Future efforts will be 
devoted to adding M giant templates to the pipeline. 

The second outlier, represented by a diamond symbol in 
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Fig. 11.— Distribution of the flux correction factor fc, which measure 
whether the spectral energy distribution of a LAMOST M dwarf looks redder 
(fc < 1) or bluer (fc >1) than the adopted SDSS-based classification tem¬ 
plates. The distribution suggests that the flux calibration is generally good for 
most of the LAMOST commissioning spectra, albeit redder than expected on 
average. To investigate the possibility of interstellar reddening, we divide the 
stars in two groups based on their spectroscopic distance. The closer stars 
tend to have relatively fewer outliers but still look marginally redder than ex¬ 
pected, on average, which points to differences in the LAMOST and SDSS 
flux calibrations. 

Figure [8] is plotted in the bottom panels of Figure|9] Here the 
source of the offset is found to be due to instrumental factors. 
The low S/N ratio and intense skyline contamination in this 
spectrum cause a large error in the measurement of the spec¬ 
tral indices. In this case, it is the spectral index measurements 
that are in error, and our template fit method provides a more 
reliable estimate of the subtype and metallicity class. 

More generally, we find that the spectral-fit method pro¬ 
vides a better estimate of the spectral subtype because it uses 
a much broader wavelength range, whereas the Ti05, CaH2, 
and CaH3 spectral indices are narrowly defined, and thus 
much more sensitive to instrumental noise or spectral reduc¬ 
tion artefacts. We find that large scatter of the sdM stars in 
Figure |8]is thus largely due to errors in the measurement the 
spectral indices from low S/N spectra, not in errors in our 
spectral classification pipeline. We conclude suggests that our 
classification pipeline is more reliable for measuring the spec¬ 
tral type and the metallicity than if we were using the spectral 
indices alone. 

To verify the consistency of our newly introduced metallic¬ 
ity subclasses (r,s,p), we plot the spectral index distribution of 
dM dwarfs for the three different subclasses in Figure[lO] The 
slightly different locations (layering) of the dMp, dMs, and 
dMr stars suggest that the metallicity subclasses defined in our 
spectral fitting method also consistently separate out the stars 
in the spectral index grid following the expected trends for 
metallicity variations, with the more metal-rich stars (dMr) 
having lower Ti05 values for a given CaH2+CaH3, and the 
more metal-poor stars (dMp) having higher Ti05 values. 

4.4. Flux correction 

As we defined in Section 3.3, the flux correction, fc, is 
mostly useful to verify the quality of the LAMOST flux cal¬ 
ibration. Figure QT| shows the distribution of fc values for 
all 2,612 stars classified as M dwarfs/subdwarfs. The distri¬ 
bution suggests that the flux calibration is in general agree¬ 
ment with the SDSS calibration for most of the LAMOST 
commissioning spectra. However, we find a median value of 


TABLE 3 

The absolute magnitude ( Mj ) for spectral 
SUBTYPES (SpTY) 



d... 

sd... 

esd... 

usd... 

...K7.0 

5.600 

6.050 

7.400 

8.550 

...K7.5 

5.800 

6.425 

7.600 

8.675 

...MO.O 

6.000 

6.800 

7.800 

8.800 

...M0.5 

6.200 

7.175 

8.000 

8.925 

...M1.0 

6.400 

7.550 

8.200 

9.050 

...Ml.5 

6.600 

7.925 

8.400 

9.175 

...M2.0 

6.800 

8.300 

8.600 

9.300 

...M2.5 

7.000 

8.675 

8.800 

9.425 

...M3.0 

7.450 

9.050 

9.000 

9.550 

...M3.5 

7.900 

9.425 

9.200 

9.675 

...M4.0 

8.350 

9.800 

9.400 

9.800 

...M4.5 

8.800 

9.950 

9.600 

9.925 

...M5.0 

9.250 

10.100 

9.800 

10.050 

...M5.5 

9.700 

10.250 

10.000 

10.175 

...M6.0 

10.150 

10.400 

10.200 

10.300 

...M6.5 

10.325 

10.550 

10.400 

10.425 

...M7.0 

10.500 

10.700 

10.600 

10.550 

...M7.5 

10.675 

10.850 

10.800 

10.675 

...M8.0 

10.850 

11.000 

11.000 

10.800 

...M8.5 

11.025 

11.150 

11.200 

10.925 

...M9.0 

11.200 

11.300 

11.400 

11.050 

...M9.5 

11.375 

11.450 

11.600 

11.175 


fc < 1, which suggests that the LAMOST spectra have sys¬ 
tematically redder spectral slopes than the SDSS spectra for 
stars of the same spectral subtypes. We suspect this is due to 
minor flux calibration differences between the LAMOST and 
SDSS data reduction pipeline. The tails in the distribution, on 
the other hand, are mainly due to LAMOST spectra with in¬ 
strumental issues or data reduction/calibration shortcomings. 
The left wing (low fc values) comprises stars in which there 
is an over-subtraction of the sky lines, which makes the stars 
appear systematically redder. The right wing of the distri¬ 
bution (high fc values) comprises spectra which suffer from 
a known issue with the blue channel calibration, making the 
stars appear bluer. 

Interstellar reddening might also be affecting the distribu¬ 
tion to some extent, although the effect should be small given 
that most of these M dwarfs are relatively nearby. To examine 
this possibility, we separately plot in FigureQT]the distribution 
of fc values for the relatively nearby (d<200pc) and the more 
distant (d>200pc) M dwarfs in our list (see below). We find 
no significant offset between the two distribution, which sug¬ 
gests that interstellar reddening is negligible within our larger 
catalog, and that most of the fc values correct for instrumen¬ 
tal errors in the spectrophotometry. 

4.5. Infrared color distribution 

One potentially serious source of contamination in any red 
dwarf survey are background red giant stars, which also dis¬ 
play molecular bands of TiO, CaH, and VO, similar to the 
M dwarfs. To identify possible red giants in our sample, 
we examined the 2MASS J-H and H-K s colors, of all the 
stars identified as M dwarfs/subdwarfs by our pipeline. M 
giants occupy a distinct locus in the (J-H, H-K s ) diagram 
feessell &~B rett 1988t lPt>ine. & Sharal120051) : this is due to 
the development of water bands in M dwarfs which depresses 
the flux in the H and K s bands and make the dwarfs appear 
bluer. In particular, M dwarfs always show J-H < 0.7, while 
giants usually have much redder J-H colors. 

The (J-H, H-K s ) distribution of the stars identified as M 
dwarfs/subdwarfs by the LAMOST pipeline is plotted in Fig¬ 
ure [12] Stars are plotted in two separate subsets based on 
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Fig. 12.— Distribution of 2MASS J H and H-K, colors for the stars identified as late-K and M dwarfs/subdwarfs in the LAMOST commissioning data. Left'. 
the 1,080 nearest stars (d < 300 pc): their relative concentration is consistent with unreddened main sequence stars as shows in the infrared color-color diagram 
of lLepine, & Shar'd (2005). This suggest that there is no significant interstellar reddening or red giants contamination. Right', the 1,497 more distant stars 
(d > 300pc). The larger scatter is in part due to stars have fainter magnitudes, and thus larger uncertainty in their infrared colors. The distribution, however, also 
shows redder colors on average. This systematic offset suggests that some stars may suffer some form of interstellar reddening, which would be consistent with 
the larger distance range. 


their spectroscopic distance (see §5 below). The nearest stars 
(d < 200 pc) occupy a relatively concentrated area which is 
remarkably similar to the unre ddened nearby M dwarfs locus 
in the (J-H, H-K s ) diagram by ILenine. & Sharal (120051) . The 
agreement suggest that there is probably no significant inter¬ 
stellar reddening, no significant contamination from red giant 
stars. The distribution of the more distant stars (d > 200 pc), 
on the other hand, in the (J-H, H-K s ) diagram shows signif¬ 
icantly more scatter, and also displays a systematic average 
shift to redder infrared magnitudes, with many stars having 
colors J-H > 0.7. While this could be indicating the pres¬ 
ence of some red giant contaminants, this possibility is not 
supported by the distribution of spectral indices. After care¬ 
ful examination of all the LAMOST spectra for stars with 
J-H > 0.7, we conclude that the vast majority of the stars 
are indeed M dwarfs and not M giants, with the notable ex¬ 
ception of the star shown in Figure[9]and discussed above. 

Instead, we suggest that the systematically redder colors 
may indicate true interstellar reddening at least in some stars. 
This is supported by the fact that the offset affects mostly the 
more distant stars in our list. This reddening would have been 
corrected in our flux renormalization procedure, and go unno¬ 
ticed. In addition, the increased scatter may be explained by 
the fainter infrared magnitudes of the more distant M dwarfs. 
Overall, this indicates that giant star contamination is not a 
significant problem in our M dwarfs/subdwarfs catalog. 


5. DISTANCE AND KINEMATICS ANALYSIS 
5.1. Spectroscopic distances 

Spectroscopic distances for the stars were estimated based 
on the absolute infrared magnitude (MJ) to the spectral 
subtype (SpTy) relationships. These relationships were re¬ 
calibrated using a revised census of nearby M dwarfs with 
both spectral types and parallax distances, which we assem¬ 
bled. We first identified M dwarf and M subdwarf stars with 
parallax measurements documented in iMonet et ID (1992 ): 
lHarrington et al.l(ll993l):lvan Altena et alTi 19951): Henry et alJ 

i - 


2006 );_ van Leeuwenl (120071): Gatewood & CobanTi 2009); 


Lepine et al. (2009): TSmart et alJ (120101) : iKhrutskava et all 


20101: iRiedel et all (1201 llhlJao et alJ (1201 llklDittmann et alJ 
( 2014), with a smaller number of additional stars from Lepine 
et al.(2015, in preparation). Within this sample, we identi¬ 
fied 1459 s tars with known spectral subtypes as determined in 
i Reid, Hawlev. & Gizisl (1 19951): ILenine. Rich, & Sharal (I2003L 
120071) : (West et all d20 111) ; iLe pine et al J (I2013l) ~ with again 
some additional subtypes determinations from Lepine et 
al.(2015, in preparation). The final calibration sample in¬ 
cludes 1374 dM, 47 sdM, 21 esdM, and 17 usdM. The distri¬ 
bution of absolute J magnitudes as a function of the spectral 
subtype is shown in FigureQAj with one panel for each of the 
metallicity classes. For every star, random values of ± 0.1 
subtype was added, to spread out the data for more clarity. 

The absolute magnitude to spectral subtype relationship for 
esdM and usdM stars was determined with a simple fit of a lin¬ 
ear function. For sdM stars, we found a simple linear fit to be 
unsatisfactory due to an apparent inflection point at later sub- 
types, and instead determined a calibration using two linear 
segments. For M dwarfs, even the two-segment fit proved un¬ 
satisfactory, and a three segment calibration was determined 
instead. Those relationships are plotted in Figure[l3] The cor¬ 
responding values for the absolute J magnitude as a function 
of subtype are listed in Table [3] We estimate that this cal¬ 
ibration provides values of MJ with an uncertainty of about 
0.7 magnitudes, yielding spectroscopic distances with an ac¬ 
curacy of about 40%. 

FigureflAlshows the distribution of spectroscopic distances 
for stars in various ranges of spectral subtypes. Early-type 
red dwarfs (K7-M2) are clearly detected over a much larger 
distance range than mid-type objects (M3-M5), which is as 
one should expect from the magnitude limit of the LAMOST 
survey. Only in the first distance bin (d < 1 00pc) are mid¬ 
type M dwarfs significantly more numerous that early-type 
ones. This suggests that the vast majority of M dwarfs that 
will be observed and classified in the LAMOST survey will 
be early-type M dwarfs at least to within about 500pc of the 
Sun. For mid-type M dwarfs, the survey will only probe the 
volume within about lOOpc of the Sun. 
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subtype M subtype sdM 



subtype esdM subtype usdM 


FlO. 13.— Distribution of absolute magnitudes in J band for nearby M stars with both spectral subtypes and parallax distances. A random values of ± 0.1 
subtype was added, to spread out the data for more clarity. Stars from different metallicity classes (dM, sdM, esdM, usdM) are shown in different panels. The 
black lines show the adopted relationships for each metallicity subclass. 



Fig. 14.— Distribution of spectroscopic distances for stars in three spec¬ 
tral subtype ranges (K7-M0, M1-M2, and M3-M5). The early-type dwarfs 
are dominated in our catalog and are sampled over a larger spectroscopic dis¬ 
tances than other late-type dwarfs. This bias is entirely due to magnitude 
selection effects, which are expected because the early-type M dwarfs are 
intrinsically brighter than the late-type M dwarfs. 

5.2. Proper motions 

We obtained proper motions for most of the K/M dwarfs 
by searching the PPMXL catalog of iRoeser et al.l (i2010i) and 
the latest version of the S UPERBLINK proper m otion cata¬ 
log (iLepine, & Sharal2005tlLepine & Gaidosll201 ll) . We used 
the positions of the stars in the LAMOST input catalog to 
search for the positions in PPMXL of the LAMOST K/M 
dwarfs identified in our pipeline, finding matches for 2,458 
stars. Then we used the SUPERBLINK catalog to match the 
rest of the 154 stars, which provided proper motions for an 



Distance (pc) 

Fig. 15.— Distribution of transverse velocities as a function of the spectro¬ 
scopic distance. Mean velocities are calculated for stars in three distance bins 
(0-400 pc, 400-800 pc, 800-1200 pc); the red dots and error bars represent the 
median transverse velocity and the corresponding standard deviation for each 
bin. The mean transverse is largely consistent with the local kinematics of 
disk stars. However, it shows a weak growing trend with spectroscopic dis¬ 
tance, which can be simply explained by the uncertainty in the spectroscopic 
distance estimates. 

additional 30 stars. Proper motions could not be assigned for 
the remaining 124 sources. The proper motion range for K/M 
dwarfs in our catalog is 1 mas yr _I < p< 416 mas yr -1 , while 
the median value of the proper motion is 21 mas yr' 1 . 

These proper motions can be used to validate our spectro- 
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Fig. 16.— Distribution of UVW components of motion for our spectroscopically classified M dwarfs in LAMOST, based on spectroscopic distances and proper 
motion measurements using the method described in §5.5. Each M dwarf contributes only two velocity components, which means it is only shown in one of the 
panels. The (V,W) projection shows the expected asymmetric drift of local disk stars. 


scopic distances, to the extent that the calculated transverse 
motion should be consistent with the known, local kinematics 
of the Galactic disk. The distribution of transverse velocities 
is plotted in Figure [15] as a function of the spectroscopic dis¬ 
tance. We find that most of the stars have transverse motions 
in the 10-100 km s _1 range, which is indeed largely consistent 
with the local disk kinematics. The more distant stars (ac¬ 
cording to our spectroscopic distance measurements) are also 
found to have significantly smaller proper motions, which is 
also as expected. We also do not see any significant number 
of objects with unusually small transverse motions, which is 
what one would expect if background M giants were contam¬ 
inating the sample, because misclassified giants would also 
have significantly underestimated spectroscopic distances. 

We measure the mean value of the transverse motion for 
stars in three bins of spectroscopic distances (d < 400 pc, 
400 pc < d < 800 pc, 800 pc < d < 1200 pc), along with the 
dispersion about the mean. These are plotted in Figure IT5l 
All values are consistent with a mean velocity dispersion of 
~ 40km s -1 , but there is a hint of a monotonic increase of 
the mean transverse velocity with distance. This can be ex¬ 
plained by uncertainties in the spectroscopic distances: stars 
with underestimated distances will also have underestimated 
transverse motion, and stars with overestimated distances will 
also have overestimated transverse motions. Stars in the most 
distant bin have mean transverse motions about 40% larger 
than the stars in the central bin (55 km s -1 compared with 
40 km s -1 ); this suggests that spectroscopic distances may be 
over- or under-estimated by about ±40% as well. This caveat 
should be considered when conducting a kinematical analysis 
of the M dwarfs detected in LAMOST. 

5.3. Distribution of space motions 

We could not obtain radial velocities for the K/M dwarfs 
we identified due to unreliable wavelength calibration of the 
LAMOST commissioning spectra. With the improvement of 
instrument and calibration method in regular survey, the radial 
velocity estimates for K/M dwarfs are expected to be available 
for upcoming data releases. 

While the unavailability of radial velocity measurements 
limits the amount of kinematical data, it is still possible to 
carry out basic kinematics analysis from spectroscopic dis¬ 


tances and proper motions alone. This_can be done using the 
statistical method described in iLepine et al.l (120 131) . in which 
it is assumed that the radial velocity Ry = 0 for all K/M dwarfs, 
but different subsets are extracted to study the velocity dis¬ 
tribution in various projections of the local (U,V,W) velocity 
space. Here we assume U is the component on motion point¬ 
ing in the direction of the Galactic center, V the component 
in the direction of Galactic rotation, and W in the direction 
of the north Galactic pole. The statistical method uses the 
fact that in some specific directions on the sky, it is possible 
to calculate 2 of the 3 components of motion using distances 
and proper motions alone. For example one can measure the 
components U and V from proper motions and distances if 
the stars are in close proximity to the north or south Galac¬ 
tic poles (because the radial velocity contributes almost ex¬ 
clusively to the component W). Likewise the components of 
U and W can be measured from distances and proper mo¬ 
tions alone for stars located in close proximity to the Galac¬ 
tic center and anti-center, and the components of V and W 
can be measured from distances and proper motions alone for 
stars located in close proximity to the apex and antapex of the 
Galactic rotation (/ = 90,Z? = 0; and / = 270,Z? = 0). The ap¬ 
proximation however breaks down for stars that are far from 
any of those six points on the sky, but can be efficiently used 
in large surveys for specific subsets. 

Here we adopt the following approach: for every star in the 
sample, we determine which of the six canonical points (as 
described above) is closest to the star in question, and cal¬ 
culate the two components of motion which can be approxi¬ 
mated with distance and proper motion alone for stars close 
to that point on the sky. Effectively this means that a single 
pair of components of motion (either UV, UW, or VW) is cal¬ 
culated for each star, is every case assuming that the radial 
velocity is zero. Because the LAMOST commissioning data 
only covers fields of low galactic latitudes, this means that all 
stars get either estimates of (U,W) or of (V,W). 

These distributions are shown in Figure [16] It is interesting 
that the (V,W) distribution clearly displays the expected local 
asymmetric drift, with most stars having V < 0km s -1 . The 
(U,W) distribution, one the other hand, suggestively shows a 
non-isotropic structure which could possibly be the signature 
of local streams, although further analysis would be required. 
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ideally with the inclusion of radial velocity measurements, to 
confirm whether this trend is real. 

While this method does not provide complete (U,V,W) 
component for any one of the stars, we still have sufficient 
numbers of stars contributing information to each of the three 
velocity components. Simply, stars with blanked values in 
one component of motion are not used to calculate the sta¬ 
tistical moments of that component. The mean velocity and 
standard deviation for the (U,V,W) components are therefore 
calculated to be: 

< U >=-9.8± 1.4kms -1 ,cry = 35.6±6.1 kms -1 , 

< V >= -22.8 ± 0.7 kms -1 ,oy = 30.6 ±2.0 kms -1 , 

< W >= -7.9 ±0.5 kms -1 ,(j^ = 28.4 ±3.3 kms -1 . 

Where the uncertainties are estimated by bootstrap method. 

Comparing with other reported results, such as 
[Hawlev et all (fl996h for the PMSU survey. iFuchs et al.1 
(2009) for the SDSS survey, and Leni neet al.l (1201 3h for the 
SUPERBLINK survey, our (U,V,W) velocity components 
are general consistent with those values. The agreement 
further suggests that our spectroscopic distance estimates do 
not suffer from systematic error, and are reliable enough to 
conduct kinematic studies. This opens the prospect of using 
large numbers of spectroscopically confirmed M dwarfs 
from LAMOST to perform massive kinematics studies of the 
nearest lkpc. 

6. DESCRIPTION OF THE CATALOG 

The complete catalog of 2,612 M dwarfs/subdwarfs is pro¬ 
vided in Table 0] Columns 1 and 2 catalog list celestial coor¬ 
dinates at the 2000.0 epoch. Columns 3 to 6 list proper mo¬ 
tions along the Right Ascension and Declination, in milliarc- 
seconds per year, along with the measurement errors, when 
available. A flag indicating the source of the proper motion is 
included in column 7. Columns 8 to 13 tabulate the infrared J, 
H, K s magnitudes of the counterparts in the 2MASS survey; 
22 M dwarfs do not have 2MASS magnitude because they are 
too faint to be observed by 2MASS. Spectral subtypes which 
determined by our template-fitting classification code are tab¬ 
ulated in column 14. Based on the spectral subtypes, the esti¬ 
mated spectroscopic distances is listed in column 15. 

7. CONCLUSIONS 

We have successfully tested a template-fitting method to 
automatically identify and classify the late-type K and M 
dwarfs in spectra from the LAMOST survey. As an alter¬ 
native to the classification software ’Hammer’, used to an¬ 
alyze Sloan Digital Sky Survey spectra, our procedure can 
perform more reliable spectral classification without the need 
for time-consuming, visual inspection. Instead of relying 
on the spectral indices measurement, we have assembled a 
set of classification templates by combining spectra of M 
dwarfs/subdwarfs in the Sloan Digital Sky Survey. The well- 
defined templates define a temperature and metallicity grid, 
which reliably classifies the LAMOST stars into subtypes 
(temperature) and metallicity subclasses by the template-fit 
method. The method relies on a spectrophotometric renormal¬ 
ization, which makes the fit less dependent of the observed 
spectral energy distribution of the star, and more dependent 
on the absolute and relative depths of the TiO, CaH, and other 
molecular bands. 


To improve on the classification of M dwarfs/subdwarfs, 
we introduce subdivisions in the "metallicity" class system, 
going from 4 main classes (dM, sdM, esdM, usdM) to 12 
"metallicity" subclasses, with the induction of three subdi¬ 
visions for every metallicity class, labeled “r” for “richer”, 
“p” for “poorer”, and “s” for “standard”. These subdivisions 
are included are suffixes to the existing metallicity classes, 
thus providing a metallicity sequence which runs [dMr, dMs, 
dMp, sdMr, sdMs, sdMp, esdMr, esdMs, esdMp, usdMr, us- 
dMs, usdMp], from the presumably most metal-rich to the 
more metal-poor star. The term "metallicity" is used here in a 
suggestive manner, and should not be understood in the strict 
sense. The "metallicity" axis in our classification system is 
mainly dependent on the TiO to CaH molecular band ratio, 
which is only assumed to be correlated with the star’s metal¬ 
licity. Whether these "metallicity classes" can be used to de¬ 
termine actual [Le/H] values for each star would first require 
a proper calibration and validation of the classification grid, 
which is beyond the scope of this paper. In other words, the 
proposed classification system should be used simply for clas¬ 
sification purposes at this time. 

Using the LAMOST commissioning data acquired in 2009- 
2010, we identified 2,612 M dwarfs/subdwarfs by our classi¬ 
fication pipeline, including 1,603 M dwarfs (dM), 52 metal- 
poor M subdwarfs (sdM), 5 very metal-poor extreme sub¬ 
dwarfs (esdM), and 1 probable ultra metal-pool subdwarf 
(usdM). Our quality controls and close examination of indi¬ 
vidual stars indicate that the typical accuracy for our template- 
fit method spectral classification to ±0.5 subtypes and to 
within 1 metallicity subclass (in the 12-subclass system). A 
complete list of the 2,612 M dwarfs/subdwarfs is provided in 
a table. 

Contamination by background giant stars is shown to be 
negligible in this subset. The distribution of infrared colors 
and the relative strength of TiO and CaH molecular bands 
are inconsistent with significant red giant contamination, al¬ 
though one star is indeed identified (and flagged) as a probable 
giant. 

To demonstrate proof-of-concept for using the LAMOST 
data for kinematics studies, we estimated the spectroscopic 
distances and heliocentric (U,V,W) velocity components to 
perform the kinematic analysis for all M dwarfs/subdwarfs we 
identified. A spectroscopic distance calibration is provided, 
which is based on collated parallax data for M dwarfs and M 
subdwarfs in the literature. Transverse motions are then cal¬ 
culated using proper motion data collected mainly from the 
PPMXL and SUPERBLINK proper motion catalogs. 

This preliminary work therefore demonstrates that future 
LAMOST survey programs hold the potential to identify and 
classify very large numbers of M dwarfs in all parts of the sky, 
and the data collected will be of sufficient quality to obtain 
metallicity and kinematics information on local M dwarfs to 
a distance of at least 500 parsecs. The anticipated data hold 
the promise to considerably expand the statistics of the local 
M dwarf/subdwarf populations. 
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TABLE4 

M DWARFS CATALOG,INCLUDING ASTROMETRY,PHOTOMETRY, SPECTROSCOPIC DISTANCES AND ESTIMATED SUBTYPES 1 


RAJ2000 2 

deg 

DEJ2000 

deg 

pmRA 3 

mas/yr 

pmDE 

mas/yr 

e_pmRA 4 

mas/yr 

e_pmDE 

mas/yr 

tl 5 

J 6 

mag 

e_J 7 

mag 

H 

mag 

e_H 

mag 

K s 

mag 

e_K s 

mag 

subtype 8 

Dist 9 

parsec 

18.561404 

43.167682 

7.4 

-13.9 

3.9 

3.9 

P 

14.23 

0.03 

13.56 

0.03 

13.37 

0.04 

dKp7.5 

485±145 

17.613458 

43.683223 

-0.2 

-7.9 

4.0 

4.0 

P 

15.67 

0.07 

15.00 

0.08 

14.67 

0.09 

dMsl.5 

651 ±195 

16.418509 

43.924841 

-3.6 

-2.9 

4.1 

4.1 

P 

15.79 

0.07 

15.02 

0.06 

14.78 

0.08 

sdMsO.O 

626±188 

9.583018 

40.944141 

14.5 

-16.0 

3.8 

3.8 

P 

12.16 

0.02 

11.47 

0.03 

11.30 

0.02 

sdMsO.O 

117± 35 

16.169814 

43.944604 

-3.3 

-14.4 

5.7 

5.7 

P 

15.96 

0.09 

15.41 

0.09 

15.21 

0.11 

dMsO.O 

982±294 

17.184962 

43.590305 

5.6 

-10.8 

4.0 

4.0 

P 

15.43 

0.05 

14.76 

0.05 

14.50 

0.06 

dMr0.5 

701±210 

8.997519 

41.646745 

-45.5 

-15.5 

4.0 

4.0 

P 

12.56 

0.02 

11.95 

0.02 

11.73 

0.02 

dMsl.5 

155± 46 

11.176808 

41.781084 

-79.4 

-16.2 

4.0 

4.0 

P 

12.22 

0.02 

11.66 

0.02 

11.44 

0.03 

sdMrl.O 

85± 25 

11.422563 

41.869348 

-27.0 

-23.2 

4.0 

4.0 

P 

12.98 

0.02 

12.36 

0.02 

12.19 

0.02 

esdKp7.0 

130± 39 

123.894386 

56.428029 

1.5 

8.8 

4.0 

4.0 

P 

12.78 

0.02 

12.08 

0.02 

11.97 

0.02 

dKs7.5 

248± 74 

124.445228 

56.492803 

12.5 

-98.0 

4.0 

4.0 

P 

11.88 

0.02 

11.28 

0.02 

11.13 

0.02 

dKi'7.0 

180± 54 

18.985212 

44.650124 

-24.4 

-6.9 

3.9 

3.9 

P 

13.12 

0.02 

12.42 

0.02 

12.21 

0.02 

dMs0.5 

242± 72 

17.300816 

47.198761 

-15.9 

-6.3 

4.0 

4.0 

P 

14.41 

0.04 

13.76 

0.04 

13.64 

0.04 

dKr7.0 

579±173 

16.860019 

47.571482 

-12.8 

-7.1 

4.0 

4.0 

P 

13.43 

0.02 

12.74 

0.02 

12.51 

0.02 

dKr7.5 

335±100 

20.048857 

45.172276 

21.0 

-58.0 

8.0 

8.0 

S 

12.72 

0.02 

12.03 

0.03 

11.82 

0.02 

dMsl.O 

183± 55 

18.700087 

43.153988 

191.0 

-198.0 

8.0 

8.0 

s 

13.45 

0.03 

12.95 

0.03 

12.73 

0.03 

dMr0.5 

281± 84 

91.337318 

23.565868 

-2.0 

-59.0 

8.0 

8.0 

s 

12.45 

0.02 

11.76 

0.02 

11.53 

0.02 

dMr3.5 

81± 24 

17.400468 

43.307743 

— 

— 

— 

— 

T 

15.56 

0.06 

14.90 

0.06 

14.53 

0.07 

dKp7.5 

895±268 

18.608758 

46.855343 

— 

— 

— 

— 

T 

12.94 

0.03 

12.29 

0.02 

12.09 

0.02 

sdMr0.5 

142± 42 

11.541000 

41.583000 

— 

— 

— 

— 

L 

— 

— 

— 

— 

— 

— 

dMrO.O 

— 


1 The full version of this table is available in the electronic version of the Astronomical Journal. Twenty lines of the table are printed here to show the general layout. 
^ Celestial coordinates in decimal degree,epoch 2000.0. 

3 Proper motion in RA*cos(DEJ2000). 

4 Mean error in pmRA*cos(DEJ2000). 

3 Flags has the meaning: 

# P = Row of the astrometrical parameters are from the PPMXL catalog. 

# S = Row of the astrometrical parameters are from the SUPERBLINK catalog. 

# T = Coordinates parameters are from 2MASS catalog 

# L = Coordinates parameters are from LAMOST input catalog. 

^ Infrared J, H, and K s magnitudes from the 2MASS catalog JSkrutskie et all20061) . 

7 Mean error of J, H, and K s magnitudes from the 2MASS catalog jSkrutskie et alj2006l) . 

8 Estimated spectral subtype based on the template spectral fit. 

9 Spectroscopic distance base on the absolute magnitude(Mj). 










